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ABSTRACT This paper reviews the chemical and functional aspects of the posttranslational modifications of proteins, which are achieved by the addition of vari- 
ous groups to the side chain of the amino acid residue backbone of proteins. It describes the main prosthetic groups and the interaction of these groups and the 
apoenzyme in the process of catalysis, using pyridoxal catalysis as an example. Much attention is paid to the role of posttranslational modification of proteins in 
the regulation of biochemical processes in live organisms, and especially to the role of protein kinases and their respective phosphotases. Methylation and acetyla- 
tion reactions and their role in the "histone code," which regulates genome expression on the transcription level, are also reviewed. This paper also describes the 
modification of proteins by large hydrophobic residues and their role in the function of membrane-associated proteins. Much attention is paid to the glycosylation 
of proteins, which leads to the formation of glycoproteins. We also describe the main non-enzymatic protein modifications such as glycation, homocysteination, 
and desamidation of amide residues in dibasic acids. 
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tion, Fyn, Lck — non-receptor tyrosinekinases of the Src family, Ub — ubiquitin residue, ULP — ubiquitin-like protein, Ras, Rab, Rho - protein products of the 
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INTRODUCTION 

Template biosynthesis of polypeptide chains on ribosomes 
most often does not immediately produce a fully functional 
protein. The newly formed polypeptide chain must undergo 
certain chemical modifications outside the ribosome. These 
modifications are most often driven by enzymes and take 
place after all the information supplied by the template RNA 
(mRNA) has been read, that is after mRNA translation: thus, 
these additional processes are called posttranslational modi- 
fications. 

Posttranslational protein modification processes can be 
divided into two main groups. The first group unites proteo- 
lytic processes, which are mainly cleavages of certain pep- 
tide bonds, resulting in the removal of some of the formed 
polypeptide fragments. The second group consists of the proc- 
esses that modify the side chains of the amino acid residues 
and usually do not interfere with the polypeptide backbone. 
The chemical nature and function of these modifications is 
diverse. Moreover, each type of modification is character- 
istic of certain groups of amino acid residues. The result of 
these processes is that the proteome of the cell or organism 
consists of several orders more components than there are 
genes encoding these components of the proteome. This paper 
is a review of the second group of posttranslational protein 
modifications. 

There are four main groups of protein functions that re- 
quire posttranslational modification of amino acid residue 



side chains. The functional activity of a wide number of pro- 
teins requires the presence of certain prosthetic groups cova- 
lently bound to the polypeptide chain. These are most often 
complex organic molecules which take a direct part in the 
protein's activity. The transformation of inactive apoproteins 
into enzymes is one of these modifications. Another impor- 
tant group of posttranslational modifications regulates bio- 
chemical processes by varying (sometimes switching on and 
off) enzymatic activity. Another large group of modifications 
are protein tags, which provide intracellular localization of 
proteins, including marking the proteins for transport to the 
proteasome, where they will be hydrolysed and proteolysed. 
And finally, some posttranslational modifications directly or 
indirectly influence the spatial structure of newly synthe- 
sized proteins. 

MODIFICATION OF PROTEINS BY ADDITION 
OF PROSTHETIC GROUPS 

In some cases, the last step in the biosynthesis of a functional 
protein is the covalent binding of a prosthetic group, which 
forms part of the active site [1, 2]. Table 1 shows the struc- 
tural formulas of side chain modification products after the 
covalent binding of certain cofactors to proteins, as well as 
the types of reactions in which the corresponding prosthetic 
groups take part. 

Most of the listed prosthetic groups remain covalently 
bound to the apoenzyme through the whole catalytic process. 
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Table 1 . The main prosthetic groups involved in biocatalytic reactions 



Coenzyme name 


Structure of prosthetic group derivative 


Classes of enzymes. 
Type of reaction, which involves the prosthetic group 
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(CH 2 )„ 
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N H — C H — C O— 


Carboxylases. 
E.C. 6.4.1.2; 6.4.1.3. 
Carboxylation. 
Transfer of a single carbon fragment (C0 2 ) onto acetyl- 
CoA, propionyl-CoA, and other organic molecules 
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(CH 2 ) 4 
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Acyltransferases. E.C. 2.3.1.12. 
Reduction-oxidation. Transfer of carbon fragments onto 
CoA via reductive acylation of lipoamide during oxida- 
tive decarboxylation of cc-ketoacids. 
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Acyltransferases. 
E.C. 2.3.1.85. 
Transacylation. 
Transfer of an acyl fragment from one enzyme of a 
multi-enzyme complex to another. 
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E.C.2.6.1. 
Transamination of amino acids. 
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Oxidoreductases. 
E.C. 1.3.99.1. 
Reduction-oxidation. 
Oxidation of the — CH 2 -CH 2 ~ group down to 
£rans-CH=CH- 
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Fig. 1 . A schematic repre- 
sentation of the first stage 
of the transamination reac- 
tion catalyzed by aspartate 
aminotransferase 
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The only exceptions are pyridoxal enzymes, which experi- 
ence a demodification of the protein during catalysis; namely, 
a conversion of the bond between pyridoxal phosphate and 
the lysine amino group of the apoenzyme into the bond be- 
tween the coenzyme and the substrate amino acid. A dynamic 
model of the reaction processes, catalyzed by transaminases, 
was suggested by M.Y. Karpeisky and V.I. Ivanov in 1969 [3]. 
Later [4], the authors suggested that the phosphate and me- 
thyl groups of the coenzyme act as a sort of axis around which 
pyridoxal can rotate, thus forming either enzyme-imine or 
substrate-imine covalent compounds. X-ray analysis data 
confirmed and detailed the conclusion about pyridoxal phos- 
phate multi-point binding. 

Aspartate aminotransferase (C.E. 2.6.1.1), which catalyses 
the transamination of oxalacetate and glutamate, can be used 
to illustrate the mechanism of action of pyridoxal enzymes 
(Fig. 1). 

The coenzyme of the transaminase is not present as a free 
aldehyde, rather it is an intramolecular aldimine with the 
lysine side chain amino-group (Lys-258). The enzyme-bound 
imine assures the high rate of the reaction, as compared to the 
free pyridoxal phosphate [2—4]. It is this structure that causes 
the higher activity of imines as compared to aldehydes. The 
more basic nitrogen of imines is protonated much more effi- 
ciently than the carbonyl group oxygen atom (Fig. 1, (3)). The 
resulting transfer of the proton from the cc-NH 3 + -group of the 
substrate to the atom of N-aldimine pyridoxalphosphate cre- 
ates the required cationic form of the coenzyme and, simulta- 



neously, a deprotonated amino acid (3). Moreover, the imine 
carbon is more electrophilic than the carbonyl one, which 
means that it is more easily attacked by the deprotonated 
amino group of the a-amino acid (Fig. 1, (4)). An increase of 
the electrophilicity of this site is also achieved by the interac- 
tion of heterocycle nitrogen with an aspartate residue of the 
enzyme (hydrogen bond with Asp-222). Thus, the transitional 
imine-enzyme promotes the rapid formation of a transient 
bond between the substrate and the coenzyme. 

The described example of pyridoxal catalysis illustrates 
the fact that the apoenzyme plays as important a role in ca- 
talysis as the prosthetic group; that is, the former cannot sim- 
ply be called a carrier of the catalytic group. This is also the 
case for other prosthetic groups. 

REGULATION OF ENZYME ACTIVITY BY PHOSPHORYLATION 

The central role in reactions responsible for rearrangement of 
all intracellular processes eventually signaling either cell divi- 
sion or cell death is played by a large group of enzymes called 
protein kinases (phosphotransferases, EC 2.7.). These enzymes 
can add phosphate groups to the side chains of amino acids 
in various proteins [5—12]. y-phosphate ATP is the donor of 
a phosphate group in all such reactions. Kinases are grouped 
according to the amino acid to which they add the phosphate 
into tyrosine kinases (E.C. 2.7.10.2) and serine/threonine ki- 
nases (E.C. 2.7.11.1) [5]. Also, histidine kinases are often found 
in bacteria, plants, and fungi. The latter enzymes function 
in a two-step signal transduction system [13]. The inorganic 
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Fig. 2. The structure of phosphorylated amino acid fragments 



phosphate residue, which is attached to a histidine in the en- 
zyme itself, is then transferred onto an aspartate residue in 
the target protein. Phosphorylation of the aspartate results in 
further signal transduction [13]. Figure 2 shows the structures 
of amino acid phosphorylation products in proteins [1]. 

Concerted regulation of interactions in a multicellular or- 
ganism is achieved by the release of specialized molecules 
(hormones, cytokines, etc.) which activate a signaling cascade 
in target cells. In cases where the signal causes alterations 
in the expression level of certain genes, the final links in the 
signaling chains are transcription factors [14—18]. Target cells 
can identify the signaling molecule amongst a multitude of 
others with the help of a receptor protein present on the tar- 
get cell. This protein receptor has a specific binding site for 
the appropriate signaling molecule. Some receptors are local- 
ized on the surface of the cellular membrane, while others 
are intracellular receptors and are localized in the cytoplasm 
or inside the nucleus. A schematic representation of the main 
stages of, for example, hormone signal transduction via mem- 
brane receptors is presented in Fig. 3. At some of these stages, 
the activity of enzymes is regulated by phosphorylation. 

Membrane receptors can be divided into three functionally 
distinct structural regions. The first domain (the recognition 
domain) is situated in the N-terminal region of the polypep- 
tide chain and is located on the outside of the cellular mem- 
brane. This region carries glycosylated sites and recognizes 
and binds the signaling molecule. The second domain is the 
transmembrane domain. In some receptors, which are cou- 
pled to G-proteins, this domain consists of 7 tightly packed 
a-helix polypeptide regions. Another type of receptor has a 
transmembrane domain that consists of a single a-helix re- 
gion. The third (cytoplasmic) domain creates a chemical signal 
inside the cell, which couples the binding of a signal molecule 
(a ligand) to a specific intracellular signal. 

The cytoplasmic regions of a number of receptors which 
face onto the inner side of the membrane exhibit tyrosine 
kinase activity. For instance, the binding of the insulin hor- 
mone to its membrane receptor, which is a tyrosine kinase 
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Fig. 3. The basic stages of signal transduction via protein phosphoryla- 
tion. IF — inositoltriphosphate, DAG — diacylglycerine 



and has a phosphorylation site, causes autophosphorylation 
and leads to phosphorylation of the receptor's substrates and 
also of other proteins [10]. The epidermal growth factor re- 
ceptor (EGFR) belongs to a family of growth factor receptors 
which bind protein ligands and also exhibit tyrosine kinase 
activity [14]. After binding the appropriate ligand, the recep- 
tor forms a dimer, five tyrosine residues are autophospho- 
rylated on the C-terminus of the receptor, and the protein 
acquires intracellular tyrosine kinase activity. Further EGFR 
activity is involved in the initiation of the signal transduction 
cascade, which includes the activation of mitogen-activated 
protein kinases, protein kinase B, JNK (Jun iV-terminal ki- 
nase), or Stress Activated Protein Kinase (SAPK) - the so- 
called MAP-kinase family. This promotes DNA synthesis and 
proliferation [11, 12, 18-20]. 

Cytoplasmic domains of other receptors (somatotropin, 
prolactin, cytokines, etc.) do not exhibit tyrosine kinase ac- 
tivity themselves but are instead associated with other cyto- 
plasmic protein kinases (the so-called « Janus kinases* or JAK 
family kinases), which phosphorylate the receptors and thus 
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activate them [11, 18]. The defining feature of Janus kinases 
among all the other mammalian tyrosine kinases is their tan- 
dem kinase (JH1) and pseudokinase (JH2) domains. The latter 
is the cause for the name "Janus kinase," since they are the 
only mammalian tyrosine kinases with a pseudokinase do- 
main; thus, they have two "faces" just as the two-faced god 
Janus. The pseudokinase domain, though it is very similar to 
kinase domains, does not possess any of the residues responsi- 
ble for phosphotransferase activity. Apparently, the function 
of this domain is the regulation of catalytic activity. 

Binding of the signaling molecule by a receptor is thought 
to activate signaling via homo- and heterodimerization of the 
receptor subunits, which then bind to Janus kinases. This 
leads to autophosphorylation of the kinases and increases 
their catalytic activity. The activated Janus kinases phospho- 
rylate tyrosine residues in the subunits of the receptor, which 
allows the receptor to bind other proteins, for instance the 
Signal Transducer and Activator of Transcription proteins 
(STAT). These STAT proteins are then phosphorylated by 
the Janus kinases, form dimers, and are transported into the 
nucleus, where they bind specific DNA motifs, thus regulat- 
ing transcription (Fig. 4). 

Mitogen-activated kinases (MAPK, E.C. 2.7.11.24) respond 
to extracellular stimuli (mitogens) and regulate a range of 
cellular processes (gene expression, cell division, differentia- 
tion, and apoptosis) [11, 17-20]. This MAP signal cascade is 
conservative in eukaryotes, from yeast to mammals. 

The activity of serine/threonine protein kinases is influ- 
enced by a number of factors, for instance damage to DNA, 
and also a range of chemical signals, including cAMP, cGMP, 
diacylglycerol, and Ca 2+ calmodulin [5, 8, 21-24]. This type 



of protein kinases phosphorylates serine or threonine resi- 
dues in consensus sequences, which form a phosphoaccep- 
tor site. This amino acid sequence in the substrate molecule 
allows contact between the catalytic groove of the protein 
kinase with the phosphoacceptor site, which creates kinase 
specificity not towards a certain substrate but towards a cer- 
tain family of proteins sharing the same consensus sequence. 
While the catalytic domains of the protein kinases are highly 
conservative, the recognition sites vary, which allows the 
recognition of various substrates. Protein kinases A, B, C, G, 
calmodulin-dependent protein kinases, etc. are all regulated 
by hormone signal second messengers. 

The phosphorylation reaction can take place not only at a 
single site in the protein molecule, but also at multiple sites, 
which causes the phosphorylation of the functional groups of 
various amino acid residues [25 _ 28]. Multiple phosphorylation 
is characteristic of several enzymes; for instance eukaryotic 
RNA polymerase II (E.C. 2.7.7.6) [28]. The C-terminus of this 
enzyme's major subunit carries a large number (52 for mam- 
mals, 26-27 for yeast) of repeated heptapeptide consensus 
sequences (Tyr-Ser-Pro-Thr-Ser-Pro-Ser). Multiple phos- 
phorylation of these repeats at the serine and threonine resi- 
dues enhances the binding of a large number of transcription 
elongation factors and their associated proteins. This is a vital 
step in conversion of the enzymatic transcription preinitiation 
complex into a stable elongation complex [29], which allows 
the RNA polymerase to move along the chromatin DNA. 

PROTEIN ACETYLATION 

One of the widely spread types of posttranslational modifica- 
tion that plays an important role in living organisms is acety- 
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lation [30 - 38]. The reaction takes place at the e-aminogroups 
of lysine residues, and acetyl coenzyme A acts as a donor of 
acetyl groups. The positive charge of the amino group disap- 
pears after this reaction, causing a redistribution of charge 
in the whole protein molecule, and also increasing the hy- 
drophobicity and size of the modified amino acid's side chain. 
Among other things, histones use this as a binding signal for 
transcription factors and associated proteins, i.e. transcription 
initiation. A very important feature of the proteins that can 
be acetylated is a so-called bromodomain, a conservative 110 
amino acid module [30, 31]. 

The acetylation process has been well studied on histone 
proteins [32-38]. Selective acetylation of several lysine resi- 
dues creates specific chromatin affinity towards certain tran- 
scription factors, which predetermines which genes will be 
expressed. This is why the distribution of acetylation sites 
between histones and among their amino acid residues is an 
important factor in the regulation of chromatin expression 
and is usually considered as one of the elements of the "his- 
tone code," which governs the above-mentioned process. In 
general, the "histone code" includes the whole range of amino 
acid modifications in the N- and C-terminal sequences of his- 
tones (phosphorylation, acetylation, methylation, and ADP- 
ribosylation), which determines the functional status of the 
gene with respect to replication and transcription [33-38]. 

Various forms of the histone acetyltransferase (E.C. 2.3.1.48) 
catalyze the acetylation of lysine residues located at specific 
positions in the protein molecule. For instance, the octamer 
core of a nucleosome, which consists of two copies of H2A, 
H2B, H3, and H4 histones, contains 30 conservative lysine 
residues available for acetylation in the N-terminal domains 
of the proteins (residues in positions 5 and 9 in H2A; residues 
5, 12, 15 and 20 in H2B; residues 9, 14, 18, 23 and 27 in H3; and 
residues 5, 8, 12 and 16 in H4) [39]. Since the number of modi- 
fied amino acid residues and their size can vary, this creates 
a multitude of combinations for acetylated residue distribu- 
tion, which plays an important role in chromatin function. For 
instance, acetylation of Lys-18 in Saccharomyces cerevisiae 
yeast histone H3 is the main indicator of active chromatin 
transcription. This modified residue binds the largest number 
of transcription factors. Activation of p-interferon genes in 
humans requires acetylation of Lys-8 in the H4 histone and 
Lys-14 in the H3 histone [39]. 

It was discovered that acetylation of lysine residues in the 
C-terminal domains of proteins protects the protein from 
modification by ubiquitin, thus increasing the lifespan and 
active functioning time of this protein. [40]. 

ACYLATION OF PROTEINS BY HIGHER FATTY ACID RESIDUES 

The most widespread modifications by addition of fatty acid 
residues are myristoylation, which is the addition of a CH 3 - 
(CH 2 ) 12 -CO _ residue to the amino group of an N-terminal 
glycine [1, 41, 42], and palmitoylation, which is the addition 
of a CH 3 -(CH 2 ) 14 -CO- residue at the SH-group of a cysteine 
residue [1, 43, 44]. In both cases, the acylation is accomplished 
by the appropriate acyl coenzyme A, which is produced dur- 
ing oxidative decay of longer fatty acids. 

An N-terminal glycine residue [42, 45] appears in proteins 
after the N-terminal methionine residue, used to signal the 
start of translation, is cleaved away. Addition of the myristil 



group is catalyzed by the myristoyl CoA: protein iV-myris- 
toyltransferase (E.C. 2.3.1. 97) [46, 47]. The formation of an 
amide bond between glycine and myristate is an irreversible 
process. Introduction of the myristoyl residue alters the ly- 
pophilic qualities of the protein molecule and promotes weak 
and reversible interactions of the protein with the phospho- 
lipid membranes or hydrophobic domains of other proteins. 
Such an interaction is vital for cell signaling, apoptosis, and 
extracellular protein transport activities. Protein kinase A 
and GAG, one of the main structural proteins of HIV, are ex- 
amples of myristoylated proteins [45, 48]. Usually, modifica- 
tion by myristic acid acts in conjunction with other protein 
regulatory mechanisms. 

Often, myristoylation of the N-terminal glycine is followed 
by addition of a palmitic acid residue to a cysteine residue, 
thus forming a thioester bond [1, 43, 45, 49]. Unlike myris- 
toylation, this modification is reversible: there are several en- 
zymatic mechanisms that catalyze palmitoylation of cysteine 
residues, as well as their depalmitoylation [50]. 

Introducing a palmitinic acid residue has the same results 
as glycine modification by myristate, and the lypophilicity 
of the protein molecule increases. This enhances the inter- 
actions with membranes and promotes transport through 
them, while the possibility of the reverse depalmitoylation 
reaction allows the regulation of the protein activity on vari- 
ous stages of the cell cycle and cell signaling. Palmitoylation 
is usually seen in proteins that participate in signaling: G- 
proteins (small G-proteins from the Ras-family, a-subunit of 
heterotrimeric G-proteins) and non-receptor tyrosine kinases 
of the Src-family (Fyn, Lck) [43, 45, 47, 51]. 

PROTEIN UBIQUITIN YLATION 

Acylation of proteins by the activated C-terminal carboxyl 
group of glycine in ubiquitin, an 8kDa peptide consisting 
of 76 amino acid residues, is of great biological importance 
[52-59]. The main, although not the only, purpose of this 
reaction is the marking of proteins for degradation. These 
include various damaged proteins, as well as ordinary pro- 
teins which fulfill their functions in certain phases of the 
cell cycle and whose activity is unfavorable during other 
phases. 

Conjugation of the target protein and ubiquitin is a three- 
stage process. The first stage is the activation of the carboxyl 
group of ubiquitin, performed by the ubiquitin-activating en- 
zyme El using ATP, thus forming ubiquitinyl-AMP. The sec- 
ond stage is the transfer of the ubiquitin residue onto the SH- 
group of the ubiquitin-transporting protein E2. In the third 
stage, the ubiquitin-protein ligase E3 catalyses the transfer 
of ubiquitinyl residue onto the protein substrate, forming an 
amide bond between the C-teminal glycine of ubiquitin (G76) 
and a lysine residue in the target protein (substrate). A thus- 
modified protein is a target for proteolysis in proteasomes or 
lysosomes [57]. 

Whereas El is the single such enzyme in the cell, E2 has 
20-40 isoforms, and the E3 enzyme has hundreds of isoen- 
zymes, which differ by the nature of the protein substrate. 
Preliminary modification of the target protein is often needed 
in order for the E3 enzyme to recognize its substrate (phos- 
phorylation (Ser/Thr, Tyr), hydroxylation (Pro), glycosyla- 
tion (Asn), and N-terminal aminoacylation) [54]. 
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The target protein molecule can be modified by one or 
several molecules of ubiquitin. The scheme (Fig. 5) denotes 
such a product as substrate-Ub n . Polyubiquitinylation of the 
substrate involves the acylation of the ubiquitin fragment 
already bonded to the target protein (Lys-29, Lys-48 or Lys- 
63) with the C-terminal glycine residue of the other ubiquitin 
molecule [53, 60~63]. The formation of the ubiquitin-protein 
covalent adduct does not interfere with the conjugation of the 
above-named lysine residues with another ubiquitin; thus, 
this process eventually leads to polyubiquitinylation of the 
substrate protein. (Fig. 6). 

The degree to which the conjugate has been ubiquitylated 
defines its biological function. Thus, effective proteasome deg- 
radation of proteins requires tetraubiquitinylation at Lys29 
or Lys48, depending on the target protein. Misfolded proteins 
and the majority of short-lived proteins form tandem chains of 
ubiquitin residues connected by bonds at Lys48 [59]. Monou- 
biquitinylation usually takes place on random multiple lysine 
residues in the target protein. This happens during the meta- 
phase anaphase transition in mitosis, when metaphase pro- 
teins need to be "switched off." Monoubiquitylation of the hu- 
man H2B is required for the methylation of histone H3, which 
in turn is very important for chromatin remodeling and for the 
transcription activation of "silent genes" [35]. Tandems of sev- 
eral ubiquitin residues connected via Lys63 and bonded with 
PCNA (Proliferating Cell Nuclear Antigen) play in important 
role in postreplicative DNA reparation [59, 61]. 

Curently, several ubiquitin-like proteins (ULP) are known, 
and they are all grouped into the ubiquitin family including 
ubiquitin itself, Nedd8, Sumo, FatlO, ISG15, Urml, Hubl, 
etc. [53, 56~59, 62, 64]. These proteins are variously homolo- 
gous to ubiquitin in their amino acid sequence and share a 
similar spatial structure. A large number of ULP in cells in- 
dicates their involvement in a wide range of different cel- 
lular processes. Thus, Sumo is involved in nuclear transport, 
transcription regulation and chromosome segregation; ISG15 
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is part of the immune response cascade; Nedd8 is involved in 
the meiosis-mitosis switch; and Urml is implicated in cellular 
growth at elevated temperatures [59]. 

Chaperones interact with newly synthesized, misfolded 
polypeptides and act as cofactors for ubiquitinylation en- 
zymes, since they possess an ubiquitin-recognition domain. 
After the target protein has been tagged by ubiquitin, the 
chaperones escort the ubiquitinylated protein into the pro- 
teasome, where they dissociate from the protein complex. 
The ubiquitin chains are unbound, and the target protein is 
denaturated via an ATP-dependent process and then broken 
down into short peptides by proteases. 

PROTEIN ALKYLATION 

Anoter often-seen posttranslational modification is alkylation. 
This type of modification includes the methylation of lysine 
and arginine residues [26, 30, 33~38, 39, 65~72] and prenyla- 
tion (addition of pharnesyl and geranyl-geranyl moieties to 
cysteine side chains) [47, 73-80] (Fig. 7). 

Protein methylation in living organisms is catalyzed by 
methyltransferases [1, 65, 67] and involves the transfer of a 
CH 3 -group from S-adenosylmethionine according to the de- 
picted reaction (Fig. 8). 
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Fig. 9. Demethylation 
reaction of di- and 
monomethylated 
lysine residues in 
histones catalyzed by 
the FAD-dependent 
aminooxidase (fop), 
and tri-, di- and 
monomethylated 
lysine residues in 
histones catalyzed 
by histone demethy- 
lase, which functions 
in the presence of 
cofactors, Fe 2+ ions, 
a-ketoglutarate and 
ascorbate {bottom) 
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Lysine can form mono-, di- and trimethyllysines in meth- 
yltransferase-catalyzed reactions, while arginine can form 
mono- and dimethylarginines [65]. These compounds differ 
by size and hydrophobicity from the original residue. 

The mechanics of protein methylation have been best stud- 
ied in histone modification. Histone methyltransferases are 
highly specific towards the nature of the amino acid residue 
(histone-lysine methyltransferases (E.C. 2.1.1.43) and histone- 
arginine methyltransferases (E.C. 2.1.1.125)) and the position 
of this residue in the polypeptide chain [1, 65]. Lysine residue 
methylation in histones is a very important element of the 
aforementioned "histone code" [33-36, 38]. The best charac- 
terized methylation positions in histones are Lys4 and Lys9 in 
the H3 histone. Besides the mentioned residues, Lys27, Lys36, 
Arg2, Argl7 and Arg26 residues in H3 can also be modified, 
as well as Arg3 in the H4 histone [33, 34, 67, 70]. 



It was demonstrated that the trimethylated Lys4 in the 
H3 histone is necessary for transcription activation, while di- 
methylated Lys4 is found both in the active and silent gene 
[33, 34, 70]. The heterochromatin protein 1 (HP1) interacts 
with the trimethylated Lys9 of H3 via its chromodomain (a 
recognition domain for alkylated amino acid residues), initi- 
ates local chromatin condensation, and recruits other protein 
factors into the assembly of an active transcription complex 
[26, 30, 33, 67, 70]. 

Until recently, it was thought that the methylation of 
lysine residues was an irreversible process [1]. But a short 
while ago, researchers managed to extract enzymes that cat- 
alyzed the cleavage of methyl groups from lysine and argin- 
ine residues, which means that this type of posttranslational 
modification is also dynamic. Demethylation of lysine is an 
oxidative process and can be catalyzed either by the FAD-de- 



36 I ACTA NATURAE | JY° 3 2009 



REVIEWS 



NH, 



=N)H- 



C=N~H-CH, 
I 3 
NH 



(CH 2 ) 3 
vw~NH— CH— CO— <vw 



+ H,0 



NH 2 

?=° n 
NH 

(CH 2 ) 3 
w^NH— CH— CO~w 



Fig. 10. Demethylation of modified arginine residues catalyzed by the 
nuclear peptidylarginine deiminase (PAD4) [58] 



pendent polyamine oxidase, or a lysine-specific demethylase, 
which functions as a dioxygenase in the presence of cofactors, 
such as Fe 2+ ions, a-ketoglutarate, and ascorbate (E.C. 1.5.3.4) 
[37, 65, 66, 82, 83]. For schematic representation of this pro- 
cess see Fig. 9. 

A nuclear peptidylarginine deiminase (E.C. 3.5.3.15) can 
demethylate arginine residues, turning methylated arginine 
into citrulline [66] (Fig. 10). 

Thus, methylation-demethylation and acetylation- 
deacetylation of specific residues in histones are major factors 
in gene repression and activation. 

PROTEIN PRENYLATION 

Some cases of posttranslational modification are the addition 
of isoprenoid moieties onto a cysteine residue. These moieties 
are formed from isoprene residues - farnesyl and geranyl- 
geranyl (Fig. 11). Modification of proteins with these radicals 
is catalyzed by proteinfarnesyl and proteingeranyl-geranyl 
transferases, respectively (E.C. 2.5.1.58 and E.C. 2.5.1.59 or E.C. 
2.5.1.60; Type I and II geranyl-geranyl transferases). Type I 
enzymes catalyze the transfer of a gernayl-geranyl residue 
onto a cysteine residue in a Cys-A-A-X sequence, while type 
II use the Cys-Cys-X-X, X-X-Cys-Cys or X-Cys-X-Cys se- 
quences [47, 73-80], where A is a small aliphatic amino acid, 
and X are various amino acids. 

Ras-, Rab- and Rho-family proteins (products of the ras, 
rab and rho proto-oncogenes, involved in cellular growth and 
differentiation); centromeric proteins; and y-subunits of het- 
erotrimeric G-proteins, chaperones tyrosine phosphotases 
are all subjected to prenylation [47, 73, 75, 78, 79, 81]. The C- 
terminal sequence of Ras-family proteins includes a Cys-A- 



A-X motif, in which X is the amino acid that determines the 
enzyme specificity: Leu, Phe, and Met in case of the type I 
geranyl-geranyl transferase; and Ala, Gin, Ser, Met, and Phe 
in the case of the farnesyltransferase [47, 74, 78, 79]. Enzymes 
that transfer the isoprenyl residues are metalloenzymes, and 
they carry a single Zn 2+ ion for each dimeric enzyme mol- 
ecule. The zinc ion activates the cysteine thiol group for nu- 
cleophilic attack by the isoprenyl moiety [73]. The addition 
of the isoprenyl group to the Cys-A-A-X motif is usually not 
the last modification of the target protein (Ras, Rho), further 
processing occurs via proteolytic cleavage of A-A-X tripep- 
tide from the C-terminus by a Cys-A-A-X-specific protease; 
and carboxymethylation of the isoprenylcysteine residue, 
by the isosprenyl-cysteine-carboxymethyl transferase (E.C. 
2.1.1.100) [84-87] (Fig. 12). 

GTPases of the Rab family carry a Cys-Cys-X-X motif 
near the C-terminus. Both these cysteines can be modified 
by geranyl-geranyl residues with the help of type II protein 
geranyl-geranyl transferase, which creates two lipid anchors 
on the protein molecule [74, 75]. Such a protein exhibits in- 
creased affinity towards lipid membranes, and it can thus 
act as a unique recognition site for specific protein-protein 
interactions. 

Proteins of the Rab family are involved in intracellular 
vesicle transport circulating between the cellular membrane 
and the cytosol. Reversible association of the protein with the 
cellular membrane is achieved through the isoprenyl residues 
decorating these proteins [75, 84]. 

Since 20-30% of all human oncological conditions are 
caused by mutations in Ras family proteins, enzymes that 
modify these proteins with isoprenyl residues can serve as 
targets for anti-tumor drugs [73, 79]. 

PROTEIN GLYCOSYLATION 

Glycosylation of proteins plays a very important role in the 
functioning of eukaryotic cells. Glycosylation modifies the 
OH-groups of serine and threonine residues (O-glycosylation) 
and the functional groups of asparagine residue side chains 
(iV-glycosylation) (Fig. 13). 

JV-glycosylation of proteins happens at the carboxyamide 
nitrogen atom of an asparagine residue in the context Asn-X- 
Ser/Thr. N-glycoside formation begins in the endoplasmic re- 
ticulum. The oligosaccaryl transferase enzyme (E.C. 2.4.1.119) 
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Fig. 12. Prenylation of the 
Ras protein: 1 — addition 
of a farnesyl residue onto 
the Cys-A-A-X sequence 
(A- a small aliphatic amino 
acid residue, X is Leu, Phe 
or Met); 2 — Cleaving of the 
A-A-X tripeptide by the Ras- 
converting enzyme, which is 
a CysAAX-endopeptidase; 
3 - carboxymethylation of 
the isoprenylcysteine resi- 
due catalyzed by the isopre- 
nylcysteine carboxymethyl- 
transferase [86] 
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Fig. 13. Structures of the products of N-acetylglucosamine addition onto 
serine and asparagine side chains in proteins 



transfers a branched tetradecasaccharide fragment onto the 
target protein. This fragment is (Glc 3 Man 9 (GlcNAc) 2 ), and it 
comes from the carbohydrate donor molecule dolilchol pyro- 
phosphate. 

The vast variety of glycoproteins is assured by the process- 
ing of the protein-bound tetradecasaccharide residue, which 
is accomplished by a set of glycosidases and glycosyl trans- 
ferases. 

Figure 15 presents the structure of a bound tetradecasac- 
charide and the products of the first stages of processing, 
which are catalyzed by glucosidases I and II (E.C. 3.2.1.106) 
that cleave away two glucose residues, and mannosidases 
(E.C. 3.2.1.130) that cleave away 6 mannose residues. The glyc- 
oprotein formed after separation of the two glucose residues, 
and thus bearing an N-bound dodecasaccharide residue, is 
then recognized by the chaperones calnexin and calreticulin, 
which facilitate correct folding of the protein while it is being 
transported from the location of synthesis on the membrane- 
bound ribosomes to the inside of the endoplasmic reticulum 
[1, 88, 89, 90~93]. After a third glucose residue is cleaved away 
by an endoplasmic reticulum glucosidase, the chaperones lose 



MeOH 



their affinity towards the undecasaccharide and dissociate 
from the glycoprotein complex. UDP-glucose:glycoprotein 
glucosyltransferase (E.C. 2.7.8.19) returns a glucose residue 
back onto the undecasaccharide, which makes canexin and 
calreticulin continue the glycoprotein folding. This is a mech- 
anism for maintaining the functional structure in secreted 
glycoproteins. 

If a glycoprotein is not folded correctly during several 
rounds of deglycosylation-reglycosylation, then it is trans- 
ported into the cytosol. There, it is polyubiquitylated by the 
E3-ligase, which is a part of the degradation system for mis- 
folded proteins in the endoplasmic reticulum and is hydro- 
lyzed in the proteosomes [1, 88, 89, 90-94]. 

Correctly folded Man 9 (GlcNAc) 2 N-glycoprotein loses 6 
mannose residues with the help of endoplasmic reticulum 
and Golgi apparatus mannosidases and forms a protein con- 
jugated with a core pentasaccharide (Man 3 (GlcNAc) 2 ). The 
latter can receive various monosaccharides with the help of 
a number of glycosyl transferases, of which there is a great 
many in the endoplasmic reticulum and the Golgi appara- 
tus. Thus, the variety of glycoproteins is numbered in tens of 
thousands [1, 88, 89, 95]. 

Glycoprotein O-glycoside chains are much shorter and sim- 
pler than JV-glycoside chains. Numerous proteins, including 
transcription factors, nuclear pore proteins, oncoproteins, etc., 
contain a monosaccharide residue of JV-acetylglucosamine, 
which is introduced into the protein by an O-GlcNAc-trans- 
ferase (E.C. 2.4.1.94) and can be cleaved by the appropriate 
hydrolase [1, 88, 89, 96, 97-100]. There are also di-, tri- or tet- 
raglycoside fragment bearing O-glycosides. 
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Short O-glycoside chains in O-glycoproteins are important for 
transcription activation, and they act as recognition sites during 
interaction with cell membrane receptors, which are involved in 
the transduction of signals into the cell [1, 88, 89, 100-102]. 

PROTEIN SULFATION 

Another posttranslational modification of protein molecules is 
the addition of a sulfate residue at the OH-group of tyrosine. 
Phosphoadenosylphosphosulfate acts as a sulfate donor {Fig. 
16). The reaction is catalyzed by the sulfotransferase enzyme 
(E.C. 2.8.2.20) [103, 104]. 



For instance, three tyrosine residues in the N-terminal 
region of the human chemokine cell membrane receptor (a 
regulator of anti-inflammatory immune reactions), which 
plays an important role in embryo development and in the 
immune response, are subject to posttranslational sulfation 
in the Golgi apparatus. This increases the affinity of the re- 
ceptor towards its ligand, the SDF-la chemokine. An en- 
zyme called sulfatase (E.C. 3.1.5.6) was found in lysosomes 
and was able to catalyze the hydrolysis of sulfoesters [103, 
105, 106]. 




Fig. 16. Sulfation 
reaction catalyzed 
by sulfotransferase 
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MONO- AND POLY(ADP-RIBOSYL)ATION 

Many cellular processes, such as DNA reparation, apoptosis, 
and the functioning of the spindle during cell division, use 
mono- and poly(ADP-ribosyl)ation as an important regulat- 
ing mechanism [107]. Various pathogenic bacteria secrete 
toxins that ADP-rybosylate human proteins, thus causing 
severe diseases, such as cholera, diphtheria, pertussis, and 
botulism [108-111]. 

NAD + acts as a donor of the ADP-ribosyl residue. The 
positively charged nicotinamide bond is cleaved by the ADP- 
ribosyltransf erase (E.C. 2.4.2.31) and forms a ribo-oxocarbene 
cation, which interacts with various nucleophilic groups in 
protein active sites and leads to their (ADP-ribosyl)ation (Fig. 
17) [108, 109]. 

For instance, pertussis toxin transfers the created cation 
to the thiolate chain of a cysteine residue in the active site 
of the human G^protein a-subunit. This protein regulates 
synthesis of the second messenger cAMP [1, 111, 112]. Chol- 
era toxin transfers an ADP-ribosyl residue onto the arginine 
residue in the human G s -protein a-subunit ([1, 111, 113]. The 
ADP-ribosyl residue can also be transferred by the C3 toxin 
of Clostridium botulinum onto the nucleophilic Asn41 residue 
of the minor GTPase of the Rho protein superfamily, which 
leads to actin depolymerization and impairment of the meta- 
bolic processes of the host cell [1, 111]. 



Diphtheria toxin ADP-ribosylates His715 in the eEF-2 
elongation factor and, therefore, blocks the translocation of 
peptides on ribosomes and the whole translation process in 
human cells [114]. 

In reality, His715 is subjected to stepwise complex modi- 
fication: first, an aminocarboxypropyl residue is transferred 
from 5-adenosylmethionine (SAM), then SAM-dependent 
AT,JV,JV-trimethylation takes place, then the carboxyl group 
is amidated in a glutamine-mediated fashion, thus forming 
a diphthamide residue, and only then does the toxin ADP- 
ribosylate the diphthamide residue at the N3 atom of the imi- 
dazole ring (Fig. 18) [115—117]. 

During the lifetime of the organism, the genome constant- 
ly suffers the effects of genotoxic agents of both exogenic and 
endogenic nature [118]. An approximate estimate demonstrat- 
ed that every day the genomes of human cells experience up 
to 104-106 instances of DNA damage [119]. Under these cir- 
cumstances, the stability of cell genome is one of the most im- 
portant factors in maintaining the survival of a multicellular 
organism, since any uncorrected damage to DNA can promote 
the emergence of mutator cell phenotypes [120]. Poly(ADP- 
ribose) (PAR) synthesis is one of the immediate reactions of 
the cell in response to DNA breaks under the influence of ion- 
izing radiation, or alkylating or oxidizing agents [121, 122]. 
This process is catalyzed by enzymes poly(ADP-ribose)poly- 
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Fig. 18. Modification 
of the His7 1 5 residue 
in the structure of the 
human eEF-2 elonga- 
tion factor results 
in the blocking of 
protein synthesis in 
human cells 
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merases (PARPs), which are constantly and abundantly ex- 
pressed in the cell [123]. PARPs are activated in response to 
DNA breaks and catalyze the posttranslational modification 
of a range of DNA-binding proteins by covalently adding a 
polymer poly(ADP-ribose) to the carboxyl groups of glutamic 
and aspartic acid in the acceptor proteins [124]. Currently, ap- 
proximately 30 nuclear proteins that are poly(ADP-ribosyl) 
ated in vivo and in vitro have been described [123, 125]. All 
these proteins exhibit DNA-binding activity and are involved 
in DNA metabolism (replication, transcription, reparation) 
or in chromatin formation (histones). Several enzymes of the 
poly(ADP-ribose) polymerase class have been found in eu- 
karyotes, including PARP1, PARP2, and PARP3, which have 
nuclear localization; tankyrases 1 and 2, which interact with 
telomere proteins and are thought to regulate telomere func- 
tion; VRAP (193 kDa), found in cytoplasmic ribonucleprotein 
vault-particles [126]; sPARP - a truncated form of PARP1, 
which does not require activation by DNA breaks [127]; and 
macro PARPs (BAL/PARP-9, PARP14, PARP15), which are 
involved in the epigenetic modification of chromatin [124, 
128]. Ninety percent of the nuclear poly(ADP-ribose) syn- 
thesis is caused by PARP1 activity [129]. This protein is ex- 
pressed at a constant level throughout the cell cycle, and each 
cell carries around l.OTO 6 of the protein molecules, which 
amounts to 1 protein molecule for each 6000 nucleotide pairs 
[130]. Catalytically inactive PARP1 is present in the nucleo- 
plasm and is activated by DNA breaks. It then binds to the 
damaged area and catalyzes PAR synthesis [128]. PARP syn- 
thesizes poly(ADP)-ribose in three stages: initiation, elonga- 
tion, and branching of the polymer (Fig. 19). 

The first stage involves the formation of the ester bond 
between the ADP-ribose and the carboxyl group in a gluta- 
mate residue in the acceptor protein [131, 132]. The second 
stage involves the formation of an O-glycoside bond between 
the C2' and CI" atoms of the ADP-ribose, thus creating a lin- 
ear polymer of ADP-ribose molecules [133, 134]. In the third 
stage, the glycoside bond links the C2" and CI'" atoms of the 



ADP-ribose, forming branches in the polymer structure [135, 
136] (Fig. 19). 

The rate of the chemical reaction at the mono(ADP-ribo- 
syl)ation stage is approximately 200 times slower than at the 
elongation stage [137]. Based on the measurement of kinetic 
parameters of the in vitro PARP-catalyzed poly(ADP-ribo- 
syl)ation reaction, the authors of [138] hypothesize that the 
latter reaction is inter-molecular, meaning that PARP1 func- 
tions as a homodimer at the DNA break site. Two molecules 
react with the DNA break at once, and during the reaction 
both molecules simultaneously synthesize PAR and function 
as acceptors. The covalent modification of PARP1 by the ad- 
dition of a charged poly( ADP-ribose) residue leads to altera- 
tions in the enzyme's physicochemical characteristics and its 
dissociation from the DNA-complex [139], Thus, regulation of 
PARP1 DNA-binding activity can be achieved through self- 
modification [140]. 

Discovery of poly(ADP-ribosyl)ation modifications in 
chromatin remodeling proteins, histones in vivo, and topoi- 
somerases in vitro leads to the assumption that PARP1 is in- 
volved in chromatin remodeling during DNA repair [123, 133, 
141]. It was demonstrated that the kinetic parameters of DNA 
repair reactions were influenced by the presence of histones 
on the damaged DNA [123]. In vivo poly (ADP-ribosyl)ation 
of the HI histone and the histones forming the nucleosome 
core during DNA damage can play an important role in DNA 
repair, especially if the DNA is structured as chromatin, since 
histone modification can lead to their dissociation from the 
DNA molecule, thus allowing the repair enzymes easy access 
to the damaged site [123, 140]. 

Therefore, the current overall notion is that the cell re- 
sponse to damaged DNA can be modulated by the activity 
of PARP1. On one hand, PARP1 activates repair processes, 
thus promoting cell survival; on the other hand, when DNA 
damage is irrepairable and the emergence of a mutator phe- 
notype is highly probable, "overactivation" of PARP1 induces 
cell death [142]. This is why the PAR synthesis catalyzed by 
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PARP in the process of interacting with 
DNA breaks can be regarded as a signal 
of the DNA damage level, which is used 
to determine the cell's future functional 
strategy. 

OXIDATION OF THE SULFOHYDRIDE 
MOIETY OF THE CYSTEINE 
RESIDUE IN PROTEINS 

A large number of proteins are char- 
acterized by the formation of disulfide 
bonds in a reaction between cysteine 
residues either inside a single polypep- 
tide chain or between different poly- 
peptide molecules. Such bonds fulfill a 
structural function and determine the 
tertiary and quaternary structure of 
the protein, which are vital for the pro- 
tein's metabolic functions in the organ- 
ism. This modification is also involved 
in the regulation of the cell's reduction- 
oxidation status, which affects numer- 
ous aspects of cellular processes, such 
as proliferation, differentiation, and 
apoptosis by changing the functioning 
of proteins via a reversible modification 
of cysteine residues [143 _ 147]. 

Oxidation of cysteine residues in- 
volves the following processes: forma- 
tion of a disulfide bond, the formation 
of sulfi- and sulfoacids, and binding of 
glutathione [145]. Formation of a di- 
sulfide bond is accomplished via the ox- 
idation of the electron-rich sulfhydryl 
moeity (or of the thiolate anion, which 
is generated from the former after pro- 
ton dissociation) of the cysteine resi- 
due side chain. One-electron oxidation 
of the sulfhydryl moiety leads to the 
formation of a thiyl radical, which can 
dimerize into a disulfide [147]. 

Under physiological conditions, most 
of the sulfhydryl groups are in oxidized 
form and thus involved in disulfide 
bonds. Reduction of the disulfide bonds 
in vivo is accomplished by the glutathi- 
one tripeptide y-Glu-Cys-Gly (GSH), 
which converts into oxidized glutathione 
(GSSG). High levels of NAD(P)H and of 
the glutathione reductase (E.C. 1.8.1.7) 
and thioredoxinreductase (E.C. 1.8.1.9) 
enzymes lead to the reduction of oxidized 
glutathione [143-147] (Fig. 20). As pro- 
teins move down the secretory pathways 
of eukaryotic cells, the levels of gluta- 
thione and NAD(P)H decrease, which 
is why most proteins exist in structures 
stabilized by disulfide bonds [148]. 

Oxidizing agents (hydrogen perox- 
ide, hydroxide radical) can oxidize the 
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[154] 



cysteine sulfhydryl group into the cysteine-sulfenic acid 
(-SOH) [147]. Interaction of the cysteine-sulfene acid residue 
with the closest Cys-S" group also results in the formation of 
the disulfide bond. 

Reduction of the disulfide bond can result either from thi- 
ol-disulfide exchange with either glutathione or thioredoxin 
(TSH), a low-moleclar-weight (12 kDa) protein which con- 
tains catalytically active sulfhydryl groups in its active center 
(Cys-Gly-Pro-Cys) and plays the central role in the regula- 
tion of the reduction-oxidation status of disulfide bonds in 
proteins, which in turn governs a wide range of cellular pro- 
cesses. Oxidized forms of these compounds are reduced by 
NAD(P)H and glutathione reductase/thioredoxin reductase 
[146-149]. 

Both the thiolate ion and the thiyl radical can interact with 
other oxidizing agents and radicals (such as NO') (Fig. 21). The 
resulting CysSNO molecule is involved in oxidation signaling 
in the cell [150-154]. 

HYDROXYLATION OF PROTEIN FUNCTIONAL GROUPS 

Another type of posttranslational modification is the oxida- 
tive hydroxylation reaction. This reaction takes place at non- 
nucleophilic amino acid residue side chains: the CH 2 -groups of 
proline, lysine and asparagine form 3-hydroxyproline, 4-hy- 
droxyproline, 5-hydroxyproline, and 3-hydroxyasparagine, 
and this process is catalyzed by iron-containing monooxyge- 
nases of the E.C. 1.14.16 subclass [155, 156, 157] {Fig. 22). 

Oxidized proline and lysine residues play an important role 
in the formation of hydrogen bonds in the tri-strand spatial 
structure of the connective tissue protein collagen. Oxidation 
takes place at the Pro-Gly and Lys-Gly sequences. 4-hydrox- 
yproline is found about 10 times more often than 3-hydroxy- 
proline [155-160]. 

Besides the above said, hydroxylation of specific amino 
acid residues plays a role in the function of the HIF transcrip- 
tion factor (hypoxia inducible factor) [156, 159-161]. This pro- 



tein is activated under conditions of insufficient oxygen. It 
induces the transcription of a wide range of genes, including 
the gene encoding erythropoietin, which stimulates erythro- 
cyte differentiation from precursor cells, thus increasing the 
transport of oxygen to cells suffering from hypoxia [160]. 

The a-subunit of the human HIFa|3 is posttranslationally 
hydroxylized in the central region of the molecule at two pro- 
line residues, Pro402 and Pro564, forming 4-OH-Pro, and also 
in the C-terminal region at Asn803, forming 3-OH-Asn [156]. 
A molecule bearing hydroxylized proline residues is subject- 
ed to ubiquitylation by the E3 ligase, and the lifespan of HIF 
is determined by the rate of hydroxylation, ubiquitylation, 
and proteolysis in the proteasomes. Low 0 2 pressure causes 
slow hydroxylation of proline. High oxygen pressure causes 
the Pro-hydroxylase to efficiently hydroxylize Pro residues, 
which increases affinity towards the E3 ligase 1 000-fold and 
causes rapid ubiquitylation and decay in the proteasomes, 
while at low oxygen pressures, HIF is fairly stable and can 
exist for a long time [162, 163]. 

The hydroxylation of proline and asparagine side chains is 
catalyzed by a family of oxygenases that contain non-heme 
iron [163]. The active site of the enzyme (Fig. 23) contains two 
histidines and one asparagine, which take up three of the six 
coordination spaces around the Fe 2+ atom, while two spaces are 
occupied by the a-ketoglutarate co-substrate; and the sixth, 
by oxygen. Interaction of the a-ketoglutarate and oxygen re- 
sults in oxidative decarboxylation and yields C0 2 and succi- 
nate, which accepts one of the oxygen atoms of the molecular 
oxygen. The second oxygen atom takes part in the generation 
of the high-valence Fe 4+ =0 complex. The latter group is an 
effective oxidizing agent, which cleaves the unactivated C-H 
bond at the C3 or C4 atom of proline, C5 of lysine and C3 of 
asparagine, thus forming "C-H and Fe 3+_ OH radicals. 

Transfer of the hydroxyl radical *OH from Fe 3+_ OH to 
•C-H results in the hydroxylation of the amino acid side 
chain, which by itself is not a donor of electrons and does not 
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act as a nucleophile in this reaction. Monooxygenases, which 
catalyze hydroxylation reactions, attach the hydroxide radi- 
cal in a stereospecific manner. 

POSTTRANSLATIONAL CARBOXYLATION 
OF THE GLUTAMIC ACID RESIDUE 

Most protein factors, which are involved in blood clotting in 
mammals, contain several residues of y-carboxyglutamic acid 
(Gla). This residue appears in blood clotting factors as a result 



of posttranslational modification; namely the fixation of CO a 
by the y-methylene carbon atom of glutamic acid (Glu) dur- 
ing the factor's progress down the secretion pathways [164— 
166]. The Gla residue side chain, which bears two negatively 
charged carboxyl groups, has a capacity to form chelate com- 
plexes with bivalent cations, which is especially important for 
interaction with the Ca z+ ion [164]. 

Gla can be found in such proteins as prothrombin and 
blood clotting factors IX and X, which are proenzymatic 




alkoxide 



Fig. 24. Vitamin In- 
dependent carboxyla- 
tion of a glutamic acid 
residue catalyzed by 
y-glutamylcarboxylase. 
The 2,3-epoxide of 
vitamin K is reduced by 
vitamin K 2,3-epoxide 
reductase 
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Fig. 25. Glycation of 
proteins in the pres- 
ence of D-glucose. 
The rectangles show 
the main precursors 
of AGEs, which are 
formed during glyca- 
tion 



forms of proteases [164]. Carboxylation of 10 — 12 Glu residues 
in the JV-terminal region of the proenzymes in a sequence of 
up to 40 amino acids leads to the binding of several Ca 2+ ions 
and to conformation alteration of the blood clotting factors, 
which then associate on the surface of platelets adjacent to 
the proteases, which activate the factors by partial proteoly- 
sis and initiate the blood clotting cascade [164-166]. 

Carboxylation of the glutamic acid residue is catalyzed 
by the y-glutamilcarboxylase (E.C. 1.14.99.20), which uses 



the reduced (dihydronaphtochinol) form of vitamin K (Fig. 
24) [1, 164-166]. The oxidation of the reduced form of vita- 
min K by oxygen results in the formation of a hyperperox- 
ide adduct of vitamin K, which forms a cyclic alkoxide an- 
ion, 2,3-epoxide of vitamin K, and generates a strong base, 
which captures a proton from the y-methylene carbon atom 
of glutamic acid. The formed carbanion attacks the carbon 
atom of CO a and forms a new C-C bond in the malonyl side 
chain of the Gla residue. Reduction of the 2,3-epoxide of vi- 
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Fig. 26. Structure of 
certain AGEs formed 
as a result of in vivo 
protein modification 
by D glucose 
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tamin K into its original form is catalyzed by 2,3-epoxydere- 
ductase (E.C. 1.1.4.1), which is associated in a complex with 
protein disulfide isomerase in the endoplasmic reticulum 
(E.C. 1.8.4.2) [167]. 

NON-ENZYMATIC MODIFICATION 
OF FUNCTIONAL GROUPS IN PROTEINS 

PROTEIN GLYCATION 

Protein glycation is an endogenous non-enzymatic addition 
of reducing sugar residues present in the bloodstream to the 
side chains of either lysine or arginine residues in proteins. A 
schematic representation of the glycation process, which can 
be divided into the early and late stages, is shown on Fig. 25. 
The first stage of glycation involves the nucleophilic attack of 
the glucose carbonyl group by an e-amino group of lysine or a 
guanidine moiety of arginine, which results in the formation 
of a labile Schiff base - JV-glycosylimine (1). The formation 



of the Schiff base is a relatively rapid and reversible process 
[168]. Next, the glycosylimine regroups and forms an Amadori 
product, 1 -amino- 1-deoxyfructose (2). This process happens 
more slowly than the formation of glycosylimine, but much 
quicker if compared to the rate of Schiff base hydrolysis. This 
is why proteins bearing 1-amino-l-deoxyfructose residues 
tend to accumulate in blood. Modification of lysine residues 
at the early glycation steps is thought to be facilitated by the 
close proximity of histidine or lysine residues, which catalyze 
this process [169]. 

The late stage of glycation, which involves transforma- 
tions of the JV-glycosylimine and the Amadori product, is 
a slower and less studied process. It results in the forma- 
tion of stable, advanced glycation end-products (AGEs) 
{Fig. 26). There are published data [170] on the direct in- 
volvement of a-dicarbonyl compounds in AGE formation 
(glyoxal (3), methylglyoxal (4), and 3-deoxyglucosone (5)). 
These compounds form in vivo both during glucose degra- 



46 | ACTA NATURAE | JVfe 3 2009 



REVIEWS 



a) ^OH 
H,c' 



0 



CH, 



6 



CH 
CH 2 



OH 

W~ Ser 65 -Tyr e6 -Gly 67 



OH yT^' B 
h 2 c n ^-i jCn— CH 

.CH-C-"^ \ 



^NH 



NH 



C~vv 



CH ^° ° 

CH, 



OH 



OH 

.CH-C-- \ 

NH / 
CH 



^NH 



-CH — C~W\, 
6 

C = Q 



CH 2 
OH 

no fluorescence 



Fig. 27. Formation 
of (a) green and (b) 
red chromophores in 
proteins from tripep- 
tides by intramo- 
lecular posttransla- 
tional autocatalytic 
cyclization 



OH 



H,C 



M-NH 



,CH — C~vvv 
6 



NH, 



,c= 0 



CH 



T 



H,C 



, 0H *_-CH,— C^W 

s N 2 ii 

.CH-C^ \ 
CH 

s 

OH 



green fluorophore 



b) 



^~Gln 66 -Tyr 67 -Gly 68 -^ 



1 . cyclization 

2. dehydration 

3. oxidation 
4.0, 



NH, 



0 = C 

CH 2 

H 2 c' 

°; + >^ c __n- 

C = NH V \ 



CH — C-^W 
6 



CH 

0 

0 



Red fluorophore 



NH, 



0 = C 



CH, 



H 2 C N 

o c 

\\ + 

C — NH 



^CH— C~W\, 

"c— n 6 

" C-n 

C 
\\ 

CH 



dation and in the transformations of the Schiff base during 
the modification of lysine resides in proteins by glucose 
(Fig. 25). 

Reactions between a-dicarbonyl compounds and the 
£-amino groups of lysine residues or the guanidinium groups 
of arginine in proteins result in the formation of protein cross- 
links, which lead to complications caused by the protein glyca- 
tion seen in diabetes and other diseases. Moreover, sequential 
dehydration of the Amadori product results in the formation 
of a l-amino-4-deoxy-2,3-dion (6) and en-dion (7) at the C4 



and C5 atoms, respectively (Fig. 25). These side chains can form 
intra- and intermolecular protein crosslinks [170]. 

Some AGEs have been characterized, including 
JV E -carboxymethyl-lysine (CML) and JV-carboxyethyl-lysine 
(CEL) [171], bis(lysyl)imidazole adducts (GOLD, MOLD and 
DOLD) [172], imidazolones (G-H, MG-H m 3DG-H) [173, 174], 
pyrraline [175], argpyrimidine [176], pentosidine [177], cross- 
line [178], and vesperlysine [179] (Fig. 26)]. Among these pen- 
tosidine, crossline and vesperlysine are fluorophores, and 
their fluorescence emission maximum (k = 440 nm) is shift- 

v em ' 
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cysteinylation of 
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ed into the long-wave region, compared to tryptophan resi- 
due fluorescence in proteins [180]. This property of AGEs al- 
lows to monitor the glycation reaction progress by measuring 
the fluorescence at the excitation wavelength characteristic 
of the forming fluorophore glycation products (glucophores). 

INTRAMOELCULAR POSTTRANSLATIONAL 
AUTOCATALYTIC CYCLIZATION 

A very impressive type of posttranslational modification is 
the autocatalytic restructuring of the peptide backbone in 
the folded protein during GFP (green fluorescent protein) 
maturation. This protein is encoded by a single gene, and 
the chromofore is made up of three amino acid residues, 
Ser65-Tyr66-Gly67, capable of posttranslational autocata- 
lytic cyclization, which does not require any cofactors or sub- 
strates [181-183]. 

Formation of the chromophore requires that the precur- 
sor take on the form of a |3-barrel. This folded and colorless 
GFP-precursor bears the Ser65-Tyr66-Gly67 tripeptide in a 
spatially squeezed conformation in which the amide of Gln-67 
can attack the peptide carbonyl and form a pentatomic tet- 
rahedral adduct (Fig. 27, a). Then, this adduct is dehydrated, 
and the stable cyclic intermediate product slowly autooxi- 
dizes, forming a double bond coupled to the phenol ring of 
Tyr-66. This last oxidation reaction produces a chromofore 
with an excitation maximum of 506 nm. 

GFP is used as an in vivo vital marker, which allows the 
study of various processes taking place in live cells and or- 
ganisms [184-186]. Fusion proteins based on GFP are used in 
novel drug screenings [187, 188], apoptosis detection [189], in 
the visualization of chromosome dynamics [190], and in many 
other applications [191,192]. Several volumes of Methods in 



Enzymology [193] and Methods in Cell Biology [194] are dedi- 
cated to GFP. The discovery of fluorescent genetic markers 
was awarded the Nobel Prize in 2008. 

During the last decade, the number of studies with other 
colored proteins similar to GFP but extracted from coral has 
been steadily growing [195-197]. A drawback of these pro- 
teins is their marked propensity to aggregate, which however 
can be rectified by mutagenesis [198]. A schematic represen- 
tation of the formation of a red fluorophore from the Gln66- 
Tyr67-Gly68 tripeptide in a protein molecule is shown in Fig. 
27, b. 

PROTEIN HOMOCYSTEINYLATION 

The majority of methylation processes in live organisms 
use S-adenosylmethionine, thus forming S-adenosylhomo- 
cysteine. The latter is hydrolyzed by the adensylhomocystei- 
nase (E.C. 3.3.1.1) enzyme into adenosine and homocysteine. 
This reaction catalyzed by methionyl-tRNA synthetase (E.C. 
6.1.1.1) turns homocysteine into thiolactone (this is a side re- 
action for this enzyme) [199]. Homocysteine thiolactone is an 
acylating agent and can react with the functional groups of 
lysine residues [200-203]. The e-amino group of lysine per- 
forms a nucleophilic attack of the carbonyl carbon atom of 
the thiolactone, which results in decyclization of the lactone 
and the formation of an additional sulfhydryl moeity (Fig. 
28). 

This type of modification is characteristic of blood proteins 
(albumin, hemoglobin, transferring, and globulins) [204-207]. 
Ninety percent of the homocysteine in human blood plasma is 
incorporated into JV-homocysteylated serum albumine (HSA) 
[201]. It is known that the main HSA homocysteinylation site 
both in vitro and in vivo is the Lys-525 residue [208]. Further- 
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more, two additional albumin modification sites were discov- 
ered at Lys-4 and Lys-12 [209]. 

Homocysteine can take part in disulfide exchange reac- 
tions with S-S bonds in proteins, thus forming S-homocystei- 
nylated proteins (Fig. 29) [200, 202, 206, 207, 210]. 

Homocysteinylation of proteins has a considerable effect 
on their biological activity, including increased sensitivity 
to oxidation and increased propensities for oligomerization, 
denaturation, and sedimentation. The introduction of 8 _ 9 
homocysteine residues into the methionyl-tRNA-synthase 
and 11-12 residues into trypsin completely deactivates these 
proteins [207]. iV-homocysteinylation of human serum albu- 
min lowers its RNA-hydrolyzing activity considerably [205]. 
Multiple homocysteinylation of cellular proteins can eventu- 
ally result in cell apoptosis [200, 201, 203, 206, 210]. 

DEAMIDATION AND TRANSAMIDATION 

One of the types of posttranslational modification, which 
plays an important role in cellular functions, is the deamida- 
tion of the amides of dicarbonic acids. Many authors believe 
these reactions to be non-enzymatic cleavage of ammonia 
from the amide group of asparagine or glutamine, resulting 
in an intermediate product, a cyclic imide (Fig. 30) [211-215]. 
The rate of this product's formation is determined by the lo- 
cal amino acid surroundings and the characteristics of the so- 
lution (pH and ingredients) [213, 214]. Asparagine residues in 
proteins are deamidated 40 times more often than glutamine 
residues. Furthermore, the rate of asparagine deamidation 
is 100-fold greater than the rate of glutamine deamidation 
[214]. 

The cyclic imide decays forming either aspartate residue, 
which forms in the largest quantities (3:1), or an isoaspartate 
residue, in which the peptide bond involves the p-carboxyl 
group of the aspartate side chain [216, 217]. In the latter case, 
the length of the protein increases by one methylene group 
(CH 2 ), which can influence the structure and the functioning 
of the protein, including its stability [214, 216, 217]. 

Deamidation reactions result in the formation of an ioniz- 
able carboxyl group charged negatively under physiological 
conditions, which alters the overall charge of the protein mol- 
ecule and its spatial structure [214]. 

The [5-iospeptide bond formed by lysine and glutamine 
side chains is considered by the organism to be an aberration 



of a normal peptide bond, which is formed by the a-amino 
groups and carboxyl groups of amino acids, and is corrected 
by the protein isoaspartyl-O-methyltransferase (PIMT) (E.C. 
2.1.1.77), a widespread cellular enzyme [211, 212, 216]. The 
deamidation reaction of Asn/Gln and a deficit of PIMT cause 
serious illnesses in humans, such as cataract [218], Alzheim- 
er's disease [219], autoimmune diseases [220], and prion-de- 
pendent encephalopathy [214, 221, 222]. 

According to Robinson's hypothesis, the instability of the 
asparagine and glutamine residues in cellular proteins under 
physiological conditions determines a key biological function, 
which is a programmed biological clock mechanism limiting 
the lifespan of proteins and peptides [212, 223, 224]. 

Deamidation, as well as ADP-ribosylation, can be caused 
by bacterial toxins. The cytotoxic necrotic factor 1 from Es- 
cherichia coli (CNF1) and the dermonecrotic toxin (DNT) 
from Bordetella deamidate small GTPases in the human 
organism, such as Rho A (Gln63), Racl, and Cdc42 (Gln61), 
which results in blockage of GTP hydrolysis and disorders in 
the regulation of cytoskeleton remodeling [225-228]. 
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Fig. 31 . Transamidiation catalyzed by transglutaminase (E.C. 2.3.2. 1 3) 
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Deamidation is often coupled with subsequent transami- 
dation (interaction of the E-amino group of a lysine resi- 
due with the side chain of a glutamine residue in the same 
protein molecule), which is one of the types of crosslinks 
characteristic of posttranslational modification (Fig. 31) 
[228-232]. 

This process leads to the formation of multiple bonds be- 
tween glutamine and lysine residues in protein molecules, 
which results in a massive protein aggregate whose subunits 



are cross-linked. This is an important process in the metabo- 
lism of skin and hair and also during the healing of wounds 
[233]. • 
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